Learnability of Bipartite Ranking Functions
نویسندگان
چکیده
The problem of ranking, in which the goal is to learn a real-valued ranking function that induces a ranking or ordering over an instance space, has recently gained attention in machine learning. We define a model of learnability for ranking functions in a particular setting of the ranking problem known as the bipartite ranking problem, and derive a number of results in this model. Our first main result provides a sufficient condition for the learnability of a class of ranking functions F : we show that F is learnable if its bipartite rank-shatter coefficients, which measure the richness of a ranking function class in the same way as do the standard VC-dimension related shatter coefficients (growth function) for classes of classification functions, do not grow too quickly. Our second main result gives a necessary condition for learnability: we define a new combinatorial parameter for a class of ranking functions F that we term the rank dimension of F , and show thatF is learnable only if its rank dimension is finite. Finally, we investigate questions of the computational complexity of learning ranking functions.
منابع مشابه
Uniform Convergence, Stability and Learnability for Ranking Problems
Most studies were devoted to the design of efficient algorithms and the evaluation and application on diverse ranking problems, whereas few work has been paid to the theoretical studies on ranking learnability. In this paper, we study the relation between uniform convergence, stability and learnability of ranking. In contrast to supervised learning where the learnability is equivalent to unifor...
متن کاملOn Theoretically Optimal Ranking Functions in Bipartite Ranking
This paper investigates the theoretical relation between loss criteria and the optimal ranking functions driven by the criteria in bipartite ranking. In particular, the relation between AUC maximization and minimization of ranking risk under a convex loss is examined. We characterize general conditions for ranking-calibrated loss functions in a pairwise approach, and show that the best ranking ...
متن کاملAnomaly Ranking as Supervised Bipartite Ranking
The Mass Volume (MV) curve is a visual tool to evaluate the performance of a scoring function with regard to its capacity to rank data in the same order as the underlying density function. Anomaly ranking refers to the unsupervised learning task which consists in building a scoring function, based on unlabeled data, with a MV curve as low as possible at any point. In this paper, it is proved th...
متن کاملGrammar Optimization: The simultaneous acquisition of constraint ranking and a lexicon
1 Learnability and Phonology The rise of Optimality Theory has led to a renewed interest in the acquisition and learnability of phonology. However, much of the work in the eld has concentrated on the acquisition of the correct ranking of the universal constraint set, without recognizing the interdependence between selecting a ranking and selecting the correct underlying forms: Under the assumpt...
متن کاملOn Partitioning Rules for Bipartite Ranking
The purpose of this paper is to investigate the properties of partitioning scoring rules in the bipartite ranking setup. We focus on ranking rules based on scoring functions. General sufficient conditions for the AUC consistency of scoring functions that are constant on cells of a partition of the feature space are provided. Rate bounds are obtained for cubic histogram scoring rules under mild ...
متن کامل